Integration of Heteregeneous Bio-Medical Databases: A Federated Approach using Semantic Schemas
نویسندگان
چکیده
Biomedical experiments generate a vast amount of data that needs to be organized, integrated and analyzed. Important research challenges in the retrieval of these data include flexible, integrated analysis of data held in existing heterogeneous data sources. Indeed, the problems of supporting ad hoc queries across multiple data sources and correlating the data retrieved pose a host of challenging research problems. Our approach to integrate data from heterogeneous databases is to build a federated database system with a centralized mediator. We build a shared global schema and the mappings between the schemas are captured using rules. This paper focuses on issues concerning manipulation of large volumes of biomedical data in centralized, distributed or heterogeneous environments. We develop new computer science approaches to managing biomedical data, building on major biomedical informatics initiatives at Yale including over a decade of research performed as part of the national Human Brain Project. Both the functionality and performance of our system are being tested with data from the SenseLab database, CoCoDat database, and Cell Centered Database. ∗This work is partially supported by NSF grant 0331548. This work is also supported by NIH grant P01 DC04732. †Department of Computer Science, Yale University, CT ‡Department of NeuroBiology, Yale University, CT §Yale Center for Medical Informatics, and Department of Anesthesiology, Yale University, CT ¶Department of Computer Science, Yale University, CT ‖Center for Medical Informatics, Yale University, CT
منابع مشابه
A Negotiation Process Approach for Building Federated Databases
The negotiation process is often referred to in the literature on federated databases, but is seldom covered in depth. This process is essential to determine data of the component schema to be integrated for building a federated schema and the access permissions to be granted. This paper presents our negotiation process approach which is incorporated in the integration schemas mechanism, so we ...
متن کاملA Service-based Approach to Schema Federation of Distributed Databases
In the last few years, we have witnessed a rapid growth in distributed database processing. We consider the question of data integration: how we can integrate distributed schemas into a new one and query just that new schema without losing the ability to retrieve data from the original schemas. The area in which we try to answer that question is federated databases, where the original heterogen...
متن کاملResolving Semantic Heterogeneity in Databases with a Terminological Model: Correspondence Refinement
The success of schema integration in multidatabase systems relies heavily on the determination of complete and refined correspondence relationships between them. So, the candidate schemas to be integrated must be rich and precise semantically, i.e., each of their data elements must be sufficiently defined for to be distinguished from others or identified to some of them. Our schema integration ...
متن کاملBuilding Secure Data Warehouse Schemas from Federated Information Systems
Introduction There are many heterogeneities among preexisting informations sources, such as DataBases (DBs), particularly semantic heterogeneities (see our chapter “Semantic Heterogeneity” in [1]). Any secure Federated Information Systems (FIS) must solve them, and deal with a): how schema levels are related to security, and b): operations on schemas to convert from level to level. We present t...
متن کاملReconciling Equational Heterogeneity within a Data Federation
Mappings in most federated databases are conceptualized and implemented as black-box transformations between source schemas and a federated schema. This approach does not allow specific mappings to be declared once and reused in other situations. We present an alternative approach, in which data-level mappings are represented independent of source and federated schemas as a network between “con...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006